A General Bio-inspired Method to Improve the Short-Text Clustering Task

نویسندگان

  • Diego Ingaramo
  • Marcelo Luis Errecalde
  • Paolo Rosso
چکیده

“Short-text clustering” is a very important research field due to the current tendency for people to use very short documents, e.g. blogs, text-messaging and others. In some recent works, new clustering algorithms have been proposed to deal with this difficult problem and novel bio-inspired methods have reported the best results in this area. In this work, a general bio-inspired method based on the AntTree approach is proposed for this task. It takes as input the results obtained by arbitrary clustering algorithms and refines them in different stages. The proposal shows an interesting improvement in the results obtained with different algorithms on several short-text collections.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

ITSA * : An Effective Iterative Method for Short-Text Clustering Tasks

The current tendency for people to use very short documents, e.g. blogs, text-messaging, news and others, has produced an increasing interest in automatic processing techniques which are able to deal with documents with these characteristics. In this context, “short-text clustering” is a very important research field where new clustering algorithms have been recently proposed to deal with this ...

متن کامل

Bio-Inspired Techniques in the Clustering of Texts: Synthesis and Comparative Study

Today, the development of a large scale access network internet/intranet has increased the amount of textual information available online/offline, where billions of documents have been created. In the last few years, bio inspired techniques which invaded the world of text-mining such, as clustering, represents a critical problem in the digital society especially over the world of information re...

متن کامل

Hybrid Bio-Inspired Clustering Algorithm for Energy Efficient Wireless Sensor Networks

In order to achieve the sensing, communication and processing tasks of Wireless Sensor Networks, an energy-efficient routing protocol is required to manage the dissipated energy of the network and to minimalize the traffic and the overhead during the data transmission stages. Clustering is the most common technique to balance energy consumption amongst all sensor nodes throughout the network. I...

متن کامل

DESIGNING A CURRICULUM MODEL FOR GENERAL MEDICINE WITH A COMBINED METHOD (E-LEARNING AND NON-E-LEARNING) INSPIRED BY THE AKKER MODEL: A QUALITATIVE STUDY

Background & Aims: In addition to providing health care services, medical universities have an important role in training expert and skilled manpower needed by different sections of society. In order to do so, the general medical education curriculum should be constantly reviewed and improved by eliminating the shortcomings. The aim of this study was to design a curriculum model for teaching ge...

متن کامل

A Bio-inspired Clustering Approach for Dynamic Document Distributed Analysis

Document clustering is a fundamental operation used in unsupervised document organization, automatic topic extraction and information retrieval. But most clustering technologies are limited in their application on the static document collection. Intelligence analysts are currently overwhelmed with tremendous amount of text information streams generated everyday. There is a lack of comprehensive...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010